Distinct distributions of genomic features of the 5’ and 3’ partners of coding somatic cancer gene fusions: arising mechanisms and functional implications

نویسندگان

  • Yongzhong Zhao
  • Won-Min Song
  • Fan Zhang
  • Ming-Ming Zhou
  • Weijia Zhang
  • Martin J. Walsh
  • Bin Zhang
چکیده

The genomic features and arising mechanisms of coding cancer somatic gene fusions (CSGFs) largely remain elusive. In this study, we show the gene origin stratification pattern of CSGF partners that fusion partners in human cancers are significantly enriched for genes with the gene age ofEuteleostomes and with the gene family age of Bilateria. GC skew (a measurement of G, C nucleotide content bias, (G-C)/(G+C)) is a useful measurement to indicate the DNA leading strand, lagging strand, replication origin, and replication terminal and DNA-RNA R-loop formation. We find that GC skew bias at the 5 prime (5') but not the 3 prime (3') partners of CSGFs, coincident with the polarity feature of gene expression breadth that the 5' partners are more ubiquitous while the 3' fusion partners are more tissue specific in general. We reveal distinct length and composition distributions of 5' and 3' of CSGFs, including sequence features corresponded to the 5' untranslated regions (UTRs), 3' UTRs, and the N-terminal sequences of the encoded proteins. Oncogenic somatic gene fusions are most enriched for the 5' and 3' genes' somatic amplification alongside a substantial proportion of other types of combinations. At the function level, 5' partners of CSGFs appear more likely to be tumour suppressor genes while many 3' partners appear to be proto-oncogene. Such distinct polarities of CSGFs at the evolutionary, structural, genomic and functional levels indicate the heterogeneous arsing mechanisms of CSGFs including R-loops and suggest potential novel targeted therapeutics specific to CSGF functional categories.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

I-11: Dedifferentiation of Mouse Fibroblast Cells by Chemical Induction

Induced pluripotent stem cells (iPSCs) generated by ectopic expression of four transcription factors have great promises for regenerative medicine in humans. Since the initial report of iPSCs by viral transfection, ample efforts have been made in the generation of iPSCs through nonviral approaches. Small molecules offer the advantages of low cost without genomic modification and have been used ...

متن کامل

Long non-coding RNAs and their significance in human diseases

Protein-coding genes account for only a small fraction of the human genome and most of the genomic sequences are transcriptionally silent, but recent observations indicate significant functional elements, including non-coding protein transcripts in the human genome. Long non-coding RNAs (lncRNAs) have been defined as transcripts of >200 nucleotides without protein-coding capacity that perform t...

متن کامل

The Role of Matrix Metalloproteinase-3 Functional 5A/6A Promoter Polymorphism in Tumor Cell Progression and Metastasis of Breast Cancer

In the human genome, chromosome 11 contains a cluster of matrix metalloproteinase (MMP) genes. Single nucleotide polymorphisms in the promoter region of MMP genes are important for MMP expression. A common adenine deletion polymorphism (5A) at position -1171 of the MMP-3 gene promoter (5´-AAAAAACCAT-3´ change to 5´-AAAAACCAT-3´) facilitates transcriptional factor binding and MMP-3 promoter acti...

متن کامل

Detection of Somatic Mutation in Exon 12 of DNA Polymerase β in Ovarian Cancer Tissue Samples

Background: DNA polymerase β (pol β) is a key enzyme of base excision repair pathway. It is a 1-kb gene consisting of 14 exons. Its catalytic part lies between exon 8 and exon 14. Exon 12 has a role in deoxyribonucleotide triphosphate selection for nucleotide transferase activity. Methods: Genomic DNA was isolated from ovarian carcinoma samples. Single strand conformation polymorphism...

متن کامل

Linkage between Large intergenic non-coding RNA regulator of reprogramming and Stemness State in Samples with Helicobacter pylori Infection of Gastric Cancer Cells

Background: Long noncoding RNAs (lncRNAs), as non-protein coding transcripts, play key roles in tumor progression and stemness state in many malignancies, as their aberrant expression has been found in gastric cancer (GC) as one of the most common cancer worldwide. LINC-ROR (large intergenic noncoding RNA regulator of reprogramming) identified as an involved lncRNA in human malignancies, howeve...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 8  شماره 

صفحات  -

تاریخ انتشار 2017